Effects of Easy Hybrid Parallelization with CUDA for Numerical-Atomic-Orbital Density Functional Theory Calculation
نویسندگان
چکیده
We modified a MPI-friendly density functional theory (DFT) source code within hybrid parallelization including CUDA. Our objective is to find out how simple conversions within the hybrid parallelization with mid-range GPUs affect DFT code not originally suitable to CUDA. We settled several rules of hybrid parallelization for numerical-atomic-orbital (NAO) DFT codes. The test was performed on a magnetite material system with OpenMX code by utilizing a hardware system containing 2 Xeon E5606 CPUs and 2 Quadro 4000 GPUs. 3-way hybrid routines obtained a speedup of 7.55 while 2-way hybrid speedup by 10.94. GPUs with CUDA complement the efficiency of OpenMP and compensate CPUs’ excessive competition within MPI. ∗Electronic mail: [email protected] 1 ar X iv :1 40 2. 42 47 v1 [ cs .D C ] 1 8 Fe b 20 14
منابع مشابه
Effects of Easy Hybrid Parallelization with CUDA for OpenMX
A MPI-friendly density functional theory (DFT) source code was modified within hybrid parallelization including CUDA. The objective is to find out how simple conversions within the hybrid parallelization with mid-range GPUs affect DFT code not originally suitable to CUDA. Several rules of hybrid parallelization for numerical-atomic-orbital (NAO) DFT codes were settled. The test was performed on...
متن کاملElectronic Structure Investigation of Octahedral Complex and Nano ring by NBO Analysis: An EPR Study
To calculation non-bonded interaction of the [CoCl6]3- complex embedded in nano ring, we focus on the single wall boron-nitride B18N18 nano ring. Thus, the geometry of B18N18 nano ring has been optimized by B3LYP method with EPR-II (Electron paramagnetic resonance) basis set and geometry of the [CoCl6]3- complex has been optimized at B3LYP method with Aldrich’s VTZ basis set and Stuttgart RSC 1...
متن کاملElectronic Structure Investigation of Octahedral Complex and Nano ring by NBO Analysis: An EPR Study
To calculation non-bonded interaction of the [CoCl6]3- complex embedded in nano ring, we focus on the single wall boron-nitride B18N18 nano ring. Thus, the geometry of B18N18 nano ring has been optimized by B3LYP method with EPR-II (Electron paramagnetic resonance) basis set and geometry of the [CoCl6]3- complex has been optimized at B3LYP method with Aldrich’s VTZ basis set and Stuttgart RSC 1...
متن کاملAdsorption of Vitamin C on a Fullerene Surface: DFT Studies
Density functional theory (DFT) calculations were performed to investigate adsorptions of vitamin C (Vit) on the surface a fullerene structure (Ful) in gaseous and water–solvated systems. Two models of Vit including OVit and MVit were created based on the original structure of Vit for OVit and methylation of all hydroxyl groups for MVit. All singular and hybrid structures were optimized and the...
متن کاملAn Ab initio and chemical shielding tensors calculations for Nucleotide 5’-Monophosphates in the Gas phase
Structural and magnetic properties of purine and pyrimidine nucleotides (CMP, UMP, dTMP, AMP, GMP, IMP) were studied at different levels of ab initio molecular orbital theory. These calculations were performed at the hartree-fock level and density functional B3LYP methods. Geometries were fully optimized by following Cs symmetry restrictions. The standard 6-31G** basis set which includes polari...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1402.4247 شماره
صفحات -
تاریخ انتشار 2014